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Title: A method foi 



^mining a mimotope sequence 



The invention relates to a method for determining a 
mimotope sequence . 

Nowadays, random diversity libraries are widely used 
to identify lead molecules for diagnostics, pharmaca and 
vaccins . When the lead molecule is a peptide, the methods 
used for identifying the lead molecules are often referred to 
as pepscan methods. 

Pepscan methods have been known since the early 
eighties. The basic theory behind these methods is described 
in EP-A-0 138 855 and EP-A-0 190 205. 

Two pivotal aspects of screening of random diversity 
libraries are their size, i.e. how many different compounds 
are required to identify a lead molecule, and the method of 
optimization of the structure or sequence of the lead 
molecule. Most methods described in the literature seem to be 
based on the idea that the larger the random diversity 
library, the higher the chance of finding a good lead 
molecule . 

The aspect with respect^ to wl^ich the known methods 
are most divergent, is the method of optimization of the lead 
molecule. Once the random diversity library has been 
synthesized, it can be tested for its desired activity. Of 
course it is possible to simply choose as a lead molecule the 
member of the random diversity library that scores the 
highest value for said desired activity. However, most 
methods comprise steps for optimizing the structure of said 
member that shows the highest activity, in order to arrive at 
a lead molecule that shows an even higher activity. 

Minipepscan libraries, composed of only a few 
thousand peptides, have been used to identify lead molecules 
which were quickly optimized to molecules having an activity 
similar to that of molecules arrived at from starting from 



libraries composed of millions of compounds. This has been 
disclosed in Slootstra et al . , Molecular Diversity, 1 (1995b) 
87-96, and in Slootstra et al . , J. Mol. Recogn., 10, 217-224 
(1997). In these articles, it has been shown that small 
libraries in combination with pepscan-based optimization 
methods are a valuable tool in identifying and optimizing 
lead molecules . 

The present invention aims to provide an improved 
method for identifying and optimizing a lead molecule. 

It has been found that the objective improvement is 
achieved by starting from a library of known test molecules, 
evaluating the activity of said library, and by optimizing 
the structure of a few of test molecules showing the highest 
activity, by methodically replacing each building block of 
the structure of the test molecules with all other possible 
building blocks. 

Thus, the invention relates to a method for 
determining a mimotope sequence for a receptor comprising the 
steps of: 

a) providing a library of test sequences; 

b) determining the activity of each test sequence of the 
library towards the receptor; 

c) identifying a test sequence comprising at a certain 
position a building block which, according to the results 
of step b) , is favored at said position; 

d) providing a next library of test sequences, based on said 
test sequence identified in step c) , by replacing a 
building block at selected positions of the identified 
test sequence with selected building blocks; 

e) determining the activity of each test sequence of the 
library provided in step d) towards the receptor; 

f) identifying a test sequence comprising at a certain 
position a building block which, according to the results 
of step e), is favored at said position; 



g) repeating steps d) - f) for the library of test sequences 
provided in step d) , for a number of cycles sufficient for 
finding in step f) a mimotope sequence that gives 
sufficient activity towards the receptor. 

It has been found that the present method leads , in a 
convenient manner, to a mimotope sequence having a very high 
binding strength to the desired receptor. In some cases, 
where the exact epitope (i.e. the partial sequence of an 
antigen, that provides binding to an antibody) is known, the 
present method has been found to lead to the exact structure 
of the epitope in only a few steps . 

According to the invention, a mimotope sequence for a 
receptor is determined. In this context, a mimotope sequence 
is defined as a molecule that shows a certain, minimal, 
desired activity in the presence of a given receptor. An 
example hereof is the determination of the epitope sequence 
of an antigen for a certain antibody. However, in some cases 
it may be sufficient if the exact epitope is mimicked and 
that a slightly less binding strength of the found molecule 
to the receptor suffices. 

The mimotope sequence will be a molecule that is 
composed of a number of building blocks, wherein the number 
and order of building blocks controls the properties of the 
molecule. Examples of types of mimotope sequences that may be 
determined according to the invention include peptides, 
possibly having steroid or saccharide-like structures 
connected thereto, saccharides, DNA (oligonucleotides), and 
PNA (peptide-like nucleic acids). Thus, the building blocks 
will be chosen from the groups of amino acids (both natural 
and non-natural amino acids), monosaccharides, and 
nucleotides . 

The receptor for which a mimotope sequences is 
determined according to the invention may be any compound, 
composition, microorganism, or tissue sample towards which 



one of the types of mimotope sequences may show some activity 
that can be measured, such as a binding activity. Suitable 
examples of receptors include antibodies (both monoclonal and 
polyclonal), proteins, such as enzymes, cells, hormone 
receptors, and micro-organisms. 

The first step in a method according to the invention 
is the provision of a library of test sequences. Of course, 
these test sequences are composed of the same type or class 
of building blocks as the objective mimotope sequence. 
Preferably, the test sequences are known in that, although 
they may be randomly chosen, their composition and structure 
is known. This may be accomplished by generating, for 
instance by hand or by computer, a number of sequences and 
synthesizing the sequences thus obtained. It is also possible 
to derive the library of test sequences from a compound which 
is known to have a favorable activity towards the receptor. 
In principle, any known manner of selecting test sequences to 
make up the library is suitable. 

The length of the test sequences is dependent on the 
nature of the building blocks constituting the sequences, and 
on the nature of the receptor and the desired activity 
towards said receptor. For most purposes, a length of between 
3 and 20 building blocks will be suitable. 

The number of test sequences constituting the library 
will be large enough to provide sufficient data to come to a 
good mimotope sequence in an acceptable number of 
steps/cycles. On the other hand, said number will be small 
enough to assure that the data obtained can be handled quite 
conveniently. Usually, the number of test sequences in the 
library will lie between 500 and 100,000, preferably between 
1,000 and 10,000. 

After the desired number of test sequences is 
generated, said test sequences may be synthesized. This may 
be done in any known manner, for instance as has been 



described by Slootstra et al. in Molecular Diversity, 1 
(1995b), 87-96. Of course, it is also possible to make use of 
a library which is already available, for instance because it 
has been used in a previous run of the present method. In 
that case, the test sequences will not have to be 
synthesized . 

For reasons of convenience, the test sequences are 
preferably synthesized using a minicard or a flat support 
medium as has been described in Dutch patent application 
10 . 019703 . 

The next step of a method according to the invention 
is the determination of the activity of each test sequence of 
the library towards the receptor. The manner wherein this 
determination is carried out will depend on the specific 
interaction between mimotope sequence and receptor that is 
aimed at, and on the nature of the receptor and the building 
blocks of the mimotope sequence. For instance, when the 
desired activity is a binding of the mimotope sequence to the 
receptor, and the mimotope sequence is a peptide and the 
receptor is a monoclonal antibody, the determination may 
suitable be performed in an ELISA test, either in solution or 
on a solid support. Other suitable methods of determining the 
activity include BIACORE and AFM (Atomic Force Microscope) . 
The skilled person will be able to choose a suitable manner 
of determination of the activity, given a certain receptor 
and nature of the mimotope sequence. 

From the results of the determination of the 
activities of the test sequences of the library, at least one 
test sequence will be chosen to form a basis for the 
remaining steps of the present method. Said at least one test 
sequence is, in accordance with the invention, chosen by 
identifying at a certain position a building block which, as 
appears from the results of the determination of the 
activity, is favored at said position. In this regard, by the 
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phrase "at a certain position a building block which is 
favored at said position" is meant that test sequences having 
at said position said building block show a high activity 
towards the receptor, relative to other activities found. 
5 It is possible that the at least one test sequence 

which is identified was tested for its activity towards the 
receptor and showed itself a high activity, relative to other 
activities found. 

It is also possible that the at least one test 

10 sequence which is identified, was in itself not tested. This 
can be explained as follows. For instance, it may have been 
found that a test sequence having at position 2 building 
block A shows a very high activity towards the receptor. It 
may further have been found that a test sequence having at 

15 position 4 building block B also shows a very high activity 
towards the receptor. The at least one test sequence to be 
identified may then be a sequence comprising building block A 
at position 2 and building block B at position 4. However, it 
may very well be that this sequence itself had not yet been 

20 evaluated for its activity towards the receptor. 

Generally, the number of test sequences identified 
for basing the remaining steps of a method according to the 
invention on will be between 1 and 150. A higher number of 
test sequences chosen will lead to better results, but will 

25 be relatively more cumbersome to handle. Preferably, said 

number will be between 1 and 25. The skilled person will be 
able to decide on the number of test sequences chosen, 
dependent on the desired quality of the result of the method 
and the facilities available for carrying out the method. 

30 On the basis of the identified test sequences, a next 

library of test sequences is provided in a so-called 
replacement analysis. This is done by varying the building 
blocks at positions of the identified test sequences, i.e. by 
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replacing a building block at selected positions of the 
identified test sequence with selected building blocks. 

For instance, in its simplest form the replacement 
analysis is carried out as follows. In case the test 
5 sequences are dodecapept ides of natural amino acids, first, 
the building block at position one of one of the chosen 
dodecapeptides may changed into each of the twenty naturally 
occurring amino acids, leading to 20 new test sequences. This 
is repeated for each position of the dodecapeptide , leading 
10 to 20 * 12 = 240 test sequences. Among these 240 sequences, 
the original dodecapeptide will be present 12 times. This 
procedure is repeated for each chosen dodecapeptide. Thus, a 
next library is obtained which comprises from 240 to 3600 new 
test sequences. 

15 More complex forms of this replacement analysis may 

be performed, for instance by allowing certain building 
blocks to be replaced by groups of building blocks (e.g. the 
replacement of one amino acid by two or more amino acids) . 
Usually, such a replacement by multiple building blocks will 

20 be used sparingly. Preferably, in at least one replacement 
analysis of the different cycles of the present method, at 
least one building block is replaced by a group of building 
blocks . Care should be taken that in each replacement 
analysis not too many multiple building blocks are 

25 introduced, for then problems concerning the length of the 
sequence may arise. For the same reason, the multiple group 
of building blocks should preferably not be too long. The 
skilled person will, based on his experience in the field, be 
able to judge which size and number of multiple building 

30 blocks may be introduced. 

It is also possible to replace a building block by a 
void in the replacement analysis. That way, the sequence will 
become one building block shorter. It will be clear that not 
too many building blocks should be replaced by a void at the 
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same time, as this could lead to an undesirably short (or no) 
test sequence. The skilled person will be able to judge when 
and at which position it is helpful to introduce a void into 
a test sequence. 

In addition, it is possible to allow the introduction 
of selected building blocks in the replacement analysis only, 
or to only replace building blocks at selected positions in 
the test sequences. Sometimes it is preferred to maintain 
certain building blocks at certain positions, e.g. in order 
to maintain a desired three dimensional structure of the test 
sequence. An advantage of this manner of performing the 
replacement analysis is that the total number of test 
sequences to be provided is limited. Generally, however, if 
the building blocks at all positions of the test sequence are 
replaced by all possible building blocks, the test sequences 
having the desired three dimensional structure will also be 
provided. Again, it is within the skill of the artisan to 
judge in which cases it may be helpful to carry out a 
replacement at selected positions only. He will also be able 
to judge to which positions this may apply, and which 
building blocks may be used for the replacement and which 
building blocks may best not be used in the replacement (at 
certain positions) . 

The test sequences of the next library, which 
provided on the basis of the replacement analysis of the at 
least one identified test sequence, are tested for their 
activity towards the receptor. In case a test sequence is 
present among the test sequences of this library that shows 
sufficient activity towards the receptor, the method is 
completed and the desired mimotope sequence is obtained. It 
will be clear that it will depend on the type of activity, 
the receptor, the mimotope sequence, and the objective 
application of the mimotope sequence, whether a certain 
activity is regarded sufficient. Given the circumstances, the 



skilled person will be able to choose a threshold for a 
desired activity. 

In case none of the test sequences shows sufficient 
activity towards the receptor, another cycle is carried out. 
In accordance with the invention, preferably at least two 
cycles, and thus two replacement analyses, are carried out. 
Thus, a next library of test sequences is provided, based on 
the 10 to 15 test sequences of the previous library that 
showed the highest activity, as set forth above. The test 
sequences in this next library will be evaluated for their 
activity, and so on. 

The number of cycles that has to be carried out will 
depend on the desired degree of activity of the mimotope 
sequence to be ' found for the receptor. Starting from about 
4500 randomly chosen dodecapeptides , it has been found 
possible to arrive at the exact epitope sequence for a 
monoclonal antibody in less than three cycles. 

In a preferred embodiment of the invention, the 
amount of receptor used for determining the activity of the 
test sequences towards said receptor, is lowered in each 
cycle. It has been found that building blocks that seem 
essential in the first cycle may become non-essential in 
further cycles. In other words, in said first cycle local 
optima may be found, which may be overcome in a subsequent 
cycle or in subsequent cycles. The lowering of the amount of 
the receptor in the determining of the activity of the test 
sequences has been found to assist in overcoming such local 
optima. Preferably, the amount of receptor used in the 
determination of the activities of test sequences in a 
library is reduced by a factor of from 50 to 1000, more 
preferably from 10 to 100. 

The invention will now be further elucidated by the 
following, non-restrictive examples . 
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EXAMPLES 



The described examples illustrate six variations of 
the present optimization method that has been successfully 
5 applied to lead peptides, identified with antibodies 
(monoclonal and polyclonal) raised against peptides, 
proteins, viruses, bacteria, sugars and steroids, from 
various types of peptide libraries. It can be envisaged that 
additional variations are possible. The concept that rules 
10 these variations is the repetition of replacement analyses on 
1 or more lead peptides. Local minima are overcome and 
epitopes and mimotopes are identified. 
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Materials and Methods 

Synthesis and screening minicards libraries (used in examples 
1, 2, 5, 6) 

5 Using the 20 natural L-amino acids 4550 random 

dodecapeptides (12-mers) were generated with a random 
generator programmed in Quick basic which runs on a 486 DX2 
(66 MHz) computer system. In this library the frequency of 
each residue is approximately 5%. This set of sequences was 

10 used to design additional libraries. One in which all 

sequences are in the D-amino acid form, one in which the 2nd 
and 11th position are held by a cysteine and one in which the 
3rd and 10th position are held by a cysteine. The aim of the 
latter two is to introduce a disulphide bridge into the 

15 peptide which should present the peptides as loops. Very 

important here is that the sequence of the amino acids within 
and outside the cysteines is identical to that of the first 
set of 4550 dodecapeptides. In this way the activity of these 
sets of dodecapeptides can be compared. Example-1 describes 

20 results obtained with the synthetic peptide library one in 
which the 3rd and 10th position are held by a cysteine. 
Example-2 describes results obtained with the synthetic 
peptide library composed of 4550 random dodecapeptides (12- 
mers), i.e. no cysteine or any other motif was used. 

25 The libraries were synthesized and screened using 

credit-card format mini-PEPSCAN cards (455 peptides /card) as 
described previously (Slootstra et al., 1995b). In example-1 
the binding of monoclonal antibody 26/9 to each peptide was 
tested in a PEPSCAN-based enzyme-linked immuno assay (ELISA) . 

30 Monoclonal antibody 26/9 has been described previously 

(Wilson et al., 1984; Rini et al . , 1992; Churchill et al., 
1994). Monoclonal antibody 26/9 was raised against the 



( 



12 



peptide HA175-110 of the hemagglutinin protein of influenza 
virus (X47:HA1) (Wilson et al . , 1984). 

In example-1 the 455-well creditcard- format 
polyethylene cards, containing the covalently linked 
5 peptides, were incubated with antibody 26/9 (100 pg/ml). 

After washing the peptides were incubated with rabbit-anti- 
mouse peroxidase (rampo, dilution 1/1000) (Dakopatts) (1 hr, 
25oC) , and subsequently, after washing the peroxidase 
substrate 2 , 2 ' -azino-di-3-ethylbenz thiazoline sulfonate 

10 (ABTS) and 2 ul/ml 3% H202 were added. After 1 hr the color 
development was measured. The color development of the ELISA 
was quantified with a CCD-camera and an image processing 
system. The setup consists of a CCD-camera and a 55 mm lens 
(Sony CCD Video Camara XC-77RR, Nikon micro-nikkor 55 mm 

15 f/2.8 lens), a camara adaptor (Sony Camara adaptor DC-77RR) 
and the Image Processing Software package TIM, version 3.36 
(Difa Measuring Systems, The Netherlands) . TIM runs on a 486 
DX2 (50 MHz) computer system. 

The dodecapeptide library, composed of 4550 random 

20 dodecapeptides , is screened with relatively high 

concentration of antibody (100 ug/ml, more usually 10 ug/ml, 
see examples 2-6). This concentration is approximately 2 
orders of magnitude above the concentration required to 
obtain maximal binding activity in ELISA with the native 

25 epitope peptide (not shown) . The concentration of 100 ug/ml 
was used to obtain many binding peptides from the small set 
of 4550 random dodecapeptides. In previous studies it was 
shown that such a strategy, without obtaining an unfavorable 
signal to noise ratio, can result in hundreds of binding 

30 peptides that resemble small linear or non-linear parts of 
the native epitope (Slootstra et al., 1995b; Slootstra et 
al . , 1997) . 
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Synthesis and screening of OTHER pepscan libraries (used in 
example 4 ) 

In addition to random minipepscan libraries mimotopes 
can also be identified from standard pepscan libraries. These 
5 libraries contain all overlapping 12-mers (or shorter /longer ) 

covering the linear sequence of a known protein (Slootstra et \S 
al . , 1995a) . 
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Synthesis and screening of non-pepscan libraries (used in 
examples 3, 6) 

Lead peptides can also be derived from other type of 
libraries such as for example phage-display libraries. 

5 

Sequence analysis of lead molecules (used in example-5) 

All 4550 dodecapeptides were ranked according to 
their binding activity. According to their ranking consensus 
sequences and motifs are identified. Initial methods of lead 

10 optimization that only use the sequence of the top 50 

molecules have been described in Slootstra et al. (1995b; 
1997). The present method uses all 4550 sequences. Firstly, 
the frequency and/or distribution of single amino acids and 
the dipeptide motifs 00, dipeptide motifs 0X0 and dipeptide 

15 motifs OXXO is determined using Microsoft Excel 4.0 (0, one 
of the 20 natural L-amino acids; X, any residue) . Secondly, 
properties of the amino acids (hydrophobicity , charge etc.) 
are included in the analysis. Thirdly, all this data is used 
to optimize the activity of the top 10-50 lead molecules. 

20 This is done by substitution of building blocks that inhibit 
activity and include building blocks that improve binding 
activity. 

The present sequence analysis method improves the 
activity of lead molecules through motif analysis of all 
25 sequences part of the library. 

Replacement analyses (used in examples 1, 2, 3, 4, 5, 6) 

In a replacement analysis the binding activity of 
complete series of substitutions analogs of lead peptides, in 
30 which each position is replaced by each of the other 19 
natural L-amino acids, is investigated in detail. In 
addition, it is possible to use the 20 D-amino acids as well 
as other available non-natural amino acids. Substitutions 
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that result in improved binding are combined into new 
sequences of building blocks. These new sequences are again 
tested in a 2nd replacement analysis. If necessary more 
rounds of replacement analyses are performed. After each 
5 round of a replacement analyses the antibody (or other 
soluble receptor) concentration can be lowered to obtain 
maximal binding activity (e.g. from 100 ug/ml to 0.01 jag/ml 
after 2 rounds of replacement-analyses, as shown in example- 
1) . 

10 
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Notes for the examples: 

- the 10 pg/ml, 1 jag/ml etc. denote the antibody 
concentration . 

- The line "improved position-01" etc. means that 
this change in amino acid (is underlined) improves binding 
activity in pepscan elisa at given antibody concentration. 
Positions that could not be improved are not mentioned (e.g. 
position-08 in first replacement-net of example-1, i.e. the 
original I at position 8 gives the highest activity) . 



Example-1 : 

title: Identification of an epitope through a lead peptide 
selected from 4550 random dodecapeptides (minipepscan library 
with motif XXC XXXXX XCXX, X , randomly selected amino acid, 
r^cfTl^aterials and Method) )_^0 



tools: lead peptide (SEQ ID NO: 1 ) GCGAAMNIRCYA 
methodology: two rounds of replacement analyses 
Results : 

In example-1, the lead peptide (SEQ ID NO: 
2) CGCAAMNIRCYA was derived (with antibody 26/9) from the 
random library in which the 3rd and 10th position are held by 
a cysteine (see above in Materials and methods) . All possible 
single substitution analogs of the random peptide (SEQ ID NO: 
2) CGCAAMNIRCYA tested in pepscan for binding antibody 26/9 at 
100 pg/ml. Below these results are described in detail. 

The replacement analysis of the lead peptide (SEQ ID 
NO: 2) CGCAAMNIRCYA resulted in the identification of building 
blocks that cannot be replaced by any other building blocks 
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(e.g. Y and A) , and to the identification of residues that 
can be replaced by one or two other building blocks (e.g. N) , 
and to residues that can be replaced by many other building 
blocks (e.g. M) . Some replacements improve binding activity 
5 considerably (e.g. N into D) . 

All these replacements were used to design the 
improved peptide (SEQ ID NO: 3 ) EMDEEEDIMNYA . Note that this 
peptide binds antibody 26/9 at much lower concentration, i.e. 
has improved binding activity (100 pg/ml for (SEQ ID NO: 
10 2) CGCAAMNIRCYA and 1.0 pg/ml for (SEQ ID NO: 3 ) EMDEEEDIMNYA) . 

he peptide (SEQ ID NO: 3) EMDEEEDIMNYA was run 
through a\second replacement analysis. Again some 
replacement^ improve binding activity considerably. These 
replacements Vere used to .design the improved peptide (SEQ ID 
NO: 4) EMDEEEDVBDYA. Essential is that the first part of (SEQ 
ID NO: 3) EMDEEEDIMNYA, (portion of SEQ ID NO: 3 ) EMDEEE , does 
noy contain critical residues whereas the latter part, 
\^ (/ortion of SEQ ID NO: 3 ) DIMNYA, does. Combination of the 

mproved residues in \his latter part results in the sequence 
(portion of SEQ ID NO:\)DVPDYA. The sequence (portion of SEQ 
ID NO: 4 ) DVPDYA is identical to the linear epitope of 
antibody 26/9. Thus, the l^Sad peptide (SEQ ID NO: 
2 ) CGCAAMNIRCYA derived from X f ew thousand random 
dodecapeptides was turned intoXnative epitope sequence 
25 through two replacement analyses 




REPLACEMENT ANALYSIS-I 

original lead: (SEQ ID NO: 2 ) CGCAAMNIRCYA 

activity at > 100.0 pg/ml 
30 improved position-01: (SEQ ID NO: 5 ) EGCAAMNIRCYA 

activity at > 10.0 pg/ml 

improved position-02: (SEQ ID NO: 6) CMCAAMNIRCYA 

activity at > 10.0 pg/ml 



improved position- 03 : (SEQ ID NO: 7 ) CGDAAMNIRCYA 

activity at > 10.0 pg/ml 
improved position-04: (SEQ ID NO: 8 ) CGCEAMNIRCYA 

activity at > 10.0 pg/ml 
improved position-05: (SEQ ID NO: 9) CGCAEMNIRCYA 

activity at > 10.0 pg/ml 
improved position-06: (SEQ ID NO: 10 ) CGCAAENIRCYA 

activity at > 10.0 pg/ml 
improved position-07: (SEQ ID NO: 11) CGCAAMDIRCYA 

activity at > 10.0 pg/ml 
improved position-09: (SEQ ID NO: 12 ) CGCAAMNIMCYA 

activity at > 10.0 pg/ml 
improved position-10: (SEQ ID NO: 13 ) CGCAAMNIRNYA 

activity at > 10.0 pg/ml 
combination improvements: (SEQ ID NO: 3) EMDEEED I MN YA 

activity at > 1 . 0 pg/ml 



REPLACEMENT ANALYSIS-II 
combinations rep-an. I: 

improved position-08 : 

improved position-09 : 

improved position-10 : 

combination improvements 



(SEQ ID NO: 3 ) EMDEEEDIMNYA 

activity at > 1 . 0 pg/ml 

(SEQ ID NO: 14 ) EMDEEEDVMNYA 

activity at > 0.1 pg/ml 

(SEQ ID NO: 1 5 ) EMDEEEDI PNYA 

activity at > 0.1 pg/ml 

(SEQ ID ; NO: 1 6 ) EMDEEEDIMDYA 

activity at > 0.1 pg/ml 
: (SEQ ID NO: 17 ) EMDEEED VPD YA 

activity at > 0.01 pg/ml 



Notes : 

Th^ sequence (portion of SEQ ID NO: 4)DVPDYA is the 
original epAtope . The peptide (SEQ ID NO: 4 ) EMDEEEDVPDYA has 
0-fold improved binding affinity (in solution) over the 
tive epitopeX peptide (SEQ ID NO: 18 ) YPYDVPDYASLRS . 
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Example-2 : 

5 title: Identification of a mimotope through a lead peptide 
selected from 4550 random dodecapeptides (random minipepscan 
library) 

tools: lead peptide (SEQ ID NO: 1 9 ) ANWPSAIGAFGL 

10 

methodology: three rounds of replacement analyses 
Results : 



15 random minipepscan library. The difference of example-2 with 
example-1 is that the antibody used in example-1 binds a 
linear epitope whereas the antibody used in example-2 binds a 
non-linear epitope. The following multiple replacement 
analyses were done as discussed in example-1. The identified 

20 mimotope does not resemble any region of the linear sequence 
of the native protein. 

REPLACEMENT ANALYSIS-I 

original lead: (SEQ ID NO: 1 9 ) ANWPSAIGAFGL 

25 activity at > 10.0 ug/ml 

improved position-01: (SEQ ID NO: 2 0 ) HNWPSAIGAFGL 



In example-2, lead peptides were identified through a 
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improved position-03 : 



improved position-02 : 



activity at > 5.0 ug/ml 
(SEQ ID NO: 2 1 ) AWWPSAIGAFGL 

activity at > 5.0 ug/ml 
(SEQ ID NO: 22 ) ANAPSAIGAFGL 

activity at > 5 . 0 ug/ml 



improved position-04 : 



(SEQ ID NO: 23 ) ANWSSAIGAFGL 



activity at > 5.0 ug/ml 



( 

improved position-11 : 
combination improvements 

5 

REPLACEMENT ANALYSIS-II 
combinations rep-an. I: 

improved position-01: 

10 

improved position-02 : 
improved position-03 : 
15 improved position-07: 

combination improvements 

20 

REPLACEMENT ANALYSIS-III 
combinations rep-an. II: 

improved position-03 : 

25 

improved position-06 : 
combination improvements 

30 
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(SEQ ID NO: 2 4 ) ANWSSAIGAFKL 
activity at > 5.0 jig /ml 
: (SEQ ID NO: 25) HWAS SAIGAFKL 
activity at > 1 . 0 pg/ml 

(SEQ ID NO: 2 5 ) HWASSAIGAFKL 
activity at > 1 . 0 pg/ml 

(SEQ ID NO: 2 6 ) KWASSAIGAFKL 
activity at > 0.5 pg/ml 

(SEQ ID NO: 2 7 ) HYASSAIGAFKL 
activity at > 0 . 5 pg/ml 

(SEQ ID NO: 2 8 ) HWGSSAIGAFKL 
activity at > 0.5 pg/ml 

(SEQ ID NO: 2 9 ) HWASSAMGAFKL 
activity at > 0.5 pg/ml 
: (SEQ ID NO: 30) KYG SSAMGAFKL 
activity at > 0 . 1 pg/ml 



(SEQ ID NO: 30 ) KYGSSAMGAFKL 

activity at > 0.1 pg/ml 

(SEQ ID NO: 3 1 ) KYFSSAMGAFKL 

activity at > 0.05 pg/ml 

(SEQ ID NO: 32 ) KYGSSGMGAFKL 

activity at > 0,05 pg/ml 
: (SEQ ID NO: 33) KYFSSGMGAFKL 

activity at > 0.01 pg/ml 



Example -3 : 



t 
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title: Identification of a mimotope through a lead derived 
from phage-display library composed of >1000.000 random 
hexapeptides . 

5 tools: lead peptide (SEQ ID NO: 34 ) SDTRKG* 

methodology: lengthening to the left and to the right of (SEQ 
ID NO: 34) SDTRKG with cysteines/glycines, followed by three 
rounds of replacement-analyses 

10 

Results: 

In example-3, lead peptides were identified through 
phage-display. Adjacent cysteines /glycines were added to 
improve binding activity. After obtained activity multiple 
15 replacement analyses were done as discussed in example-1 . 



23 

LENGTHENING BY CYSTEINES /GLYCINES : 
phage-display lead: SDTRKG (SEQ ID NO: 34) 

NO activity at 10.0 pg/ml 
lengthened lead: CSDTRKGC (SEQ ID NO: 35) activity at 

5 > 10.0 pg/ml 

lengthened lead: CSDTRKGCG (SEQ ID NO: 36) 

activity at > 10.0 pg/ml 



REPLACEMENT ANALYSIS-I 
10 lengthened lead: 

improved position-02 : 

improved position-03 : 

15 

improved position-05: 
improved position-06 : 
20 combination improv. : 

REPLACEMENT ANALYSIS-II 
comb, rep-an. I: 

25 

improved position-02 : 
improved position-05: 
30 improved position-06: 
combination improv. : 



CSDTRKGCG (SEQ ID NO: 36) 
activity at > 10.0 jig /ml 
CTDTRKGCG (SEQ ID NO: 37) 
activity at > 5.0 jag /ml 
CSETRKGCG (SEQ ID NO: 38) 
activity at > 5 . 0 pg/ml 
CSDTHKGCG (SEQ ID NO: 39) 
activity at > 5 . 0 pg/ml 
CSDTRYGCG (SEQ ID NO: 40) 
activity at > 5.0 pg/ml 
CTETHYGCG ( SEQ ID NO: 41) 
activity at > 1 . 0 pg/ml 

CTETHYGCG (SEQ ID NO : 4 1 ) 
activity at > 1.0 ^g/ml 
CYETHYGCG (SEQ ID NO: 42) 
activity at > 0 . 5 pg/ml 
CTETKYGCG (SEQ ID NO: 43) 
activity at > 0.5 pg/ml 
CTETHFGCG (SEQ ID NO: 44) 
activity at > 0.5 pg/ml 
CYETKFGCG (SEQ ID NO: 45) 
activity at > 0 . 1 pg/ml 
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REPLACEMENT ANALYSIS- 



III 



comb, rep-an. II: 



CYETKFGCG ( SEQ ID NO: 45) 



5 



combination improv. : 



improved position-01 : 



improved position-08 : 



activity at > 0 . 1 ug/ml 
DYETKFGCG (SEQ ID NO: 4 6) 
activity at > 0.05 ug/ml 
CYETKFGNG (SEQ ID NO: 47) 
activity at > 0.05 pg/ml 
DYETKFGNG (SEQ ID NO: 48) 
activity at > 0.01 pg/ml 
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*, this lead peptide is, ' as a synthetic peptide, not active 
(not in elisa (pepscan or standard nor in solution) . This is 
not unique. Often phage-peptides are only active as part of 
the phage-coat protein. In other formats they lose their 
15 activity. 

Example- 4 : 

20 title: Identification of a mimotope through a lead derived 
from a standard pepscan analysis. 

tools: lead peptide RVMIKLILVNFR* (SEQ ID NO: 49) and complete 
sequence of native protein {part which was used is 
25 KIYRVMIKLILVNFRMQP (SEQ ID NO: 50)). 

methodology: Two rounds of replacement-analyses, followed by 
lengthening to the left and right, again followed by two more 
rounds of replacement-analyses, finally followed by 
30 lenthening to an 18-mer mimotope. 



Results : 



/ 



25 

In example-4, a lead peptide was identified through 
standard pepscan analysis, i.e. the antibody was tested on 
all overlapping 12-mers covering the linear sequence of the 
protein. The following two rounds of replacement analyses 
5 were done as discussed in example-1. In addition, the 

mimotope was lengthened to the left and right (3 amino acids, 
this can be further lengthened) through additional 
replacement analyses . 

10 REPLACEMENT ANALYSIS-I 



original 


lead : 


RVMIKLILVNFR 


(SEQ 


ID NO: 


49) 






activity at > 


10. C 


) ug/ml 




improved 


position-01 : 


AVMIKLILVNFR 


(SEQ 


ID NO: 


50) 






activity at > 


5 . 0 


lig/ml 




improved 


position-02 : 


AIM I K L I L VN FR 


(SEQ 


ID NO: 


52) 






activity at > 


5 . 0 


ug/ml 




improved 


position-03 : 


AVPIKLILVNFR 


(SEQ 


ID NO: 


53) 






activity at > 


5 . 0 


ug/ml 




improved 


position-08 : 


AVMIKLIRVNFR 


(SEQ 


ID NO: 


54 ) 






activity at > 


5.0 


ug/ml 




improved 


position- 1 1 : 


AVMIKLILVNYR 


(SEQ 


ID NO: 


55) 






activity at > 


5.0 


ug/ml 




combination improv. : 


AIPIKLIRVNYR 


(SEQ 


ID NO: 


56) 






activity at > 


1.0 


pg/ml 




REPLACEMENT ANALYSIS-II 










comb, rep-an. I: 


AIPIKLIRVNYR 


(SEQ 


ID NO: 


56) 






activity at > 


1.0 


ug/ml 




improved 


position-01 : 


YIPIKLIRVNYR 


(SEQ 


ID NO: 


57) 






activity at > 


0.5 


ug/ml 




improved 


position-02 : 


APPIKLIRVNYR 


(SEQ 


ID NO: 


58) 






activity at > 


0.5 


ug/ml 




combination improv. : 


YPPIKLIRVNYR 


(SEQ 


ID NO: 


59) 






activity at > 


0.1 


Ug/ml 
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LENTHENING WITH NATIVE SEQUENCE: 

comb, rep-an. II: YPPIKLIRVNYR (SEQ ID NO: 59) 

activity at > 0 . 1 pg/ml 
5 lengthened left: KIYYPPIKLIRV (SEQ ID NO: 60) 

activity at > 1.0 \ig/m± 

lengthened right: IKLIRVNYRMQP (SEQ ID NO: 61) 

activity at > 1.0 pg/ml 



10 REPLACEMENT ANALYSIS-III 
lengthened left: 

improved posit-01 : 

15 improved posit-02: 

improved posit-03 : 

improved posit-08 : 

20 

combination impr . : 



REPLACEMENT ANALYSIS-IV 
25 lengthened right: 

improved posit-10: 

improved posit-11 : 

30 

improved posit-12 : 
combination impr. : 



of left lengthened peptide: 

KIYYPPIKLIRV (SEQ ID NO: 60) 
activity at > 1 . 0 pg/ml 

RIYYPPIKLIRV (SEQ ID NO: 62) 
activity at > 0.5 pg/ml 

KPYYPPIKLIRV (SEQ ID NO: 63) 
activity. at > 0.5 pg/ml 

KIWYPPIKLIRV (SEQ ID NO: 64) 
activity at > 0.5 ]ag/ml 

KIYYPPISLIRV (SEQ ID NO: 65) 
activity at > 0 . 5 |ig/ml 

RPWYPPISLIRV (SEQ ID NO: 66) 
activity at > 0 . 1 pg/ml 

of right lengthened peptide: 

IKLIRVNYRMQP (SEQ ID NO: 61) 
activity at > 1 . 0 iag/ml 

I KLI RVN YRCQP (SEQ ID NO: 67) 
activity at > 0.5 pg/ml 

IKLIRVNYRMEP (SEQ I D NO : 68) 
activity at > 0 . 5 ^g/ml 

I KL I RVN YRMQN (SEQ ID NO: 69) 
activity at > 0 . 5 pg/ml 

IKLIRVNYR CEN (SEQ ID NO: 70) 
activity at > 0 . 1 \iq/ml 



27 



combination total: RPWYPPISLIRVNYRCEN (SEQ ID NO: 71) 

activity at > 0.01 ug/ml 

5 *, lead peptide was identified from a library composed of all 
overlapping 12-mers covering the linear sequence of a 
protein. This makes it possible to lengthen the peptide to 
the left and right with adjacent amino acids, in this case 
KIY on the left and MQP on the right. Additional replacement- 
10 analyses of 12-mers, shifted three to the left or three to 
the right of the sequence finally results in an 18-mer 
mimotope . 

15 Example-5: 

title: Identification of a mimotope through a set of similar 
leads selected from 4550 random dodecapeptides (random 
minipepscan library) 

20 

tools: set of 6 lead peptides QNNMKLFRGCVP " (SEQ ID NO: 72), 
RGIKWNEMTDQW (SEQ ID NO: 73), KLQQN PT FY P P V (SEQ ID NO: 74), 
TNNCKEFAGIVP (SEQ ID NO: 75), RGILTNIMKDQW (SEQ ID NO: 76), 
IVQNNPKFFRGA (SEQ ID NO: 77) (potentially up to all 4550 
25 peptides, see material and method, in this case also 
'negative 1 amino acids are removed from the leads) 

methodology: : determination of consensus sequence, followed 
by two rounds of replacement analyses of the consensus 
30 sequence 



Results : 



28 



In example-5, a set of similar lead peptides were 
used to identify a consensus sequence. The consensus 
sequences were identified as discussed in materials and 
method. This sequence was used in the replacement analyses. 

5 

ALIGNMENT OF LEAD PEPTIDES: 



10 



QNNM KL FRG CVP ( SEQ ID NO: 72) 
RGIKWNEMTDQW (SEQ ID NO: 73) 
KLQQNPTFYPPV (SEQ ID NO: 74) 
TNNCKEFAGIVP (SEQ ID NO: 75) 
RGILTNIMKDQW (SEQ ID NO: 76) 
IVQNNPKFFRGA (SEQ ID NO: 77) 



15 



consensus 



ILQNNMKDFRG (SEQ ID NO: 78) 



REPLACEMENT ANALYSIS-I 
consensus lead: 

20 improved position-03: 

improved position-07 : 



25 



improved position-10 



combination improv . 



ILQNNMKDFRG (SEQ ID NO: 78) 
activity at > 1.0 ug/ml 

ILTNNMKDFRG (SEQ ID NO: 79) 
activity at > 0.5 pg/ml 

ILQNNMPDFRG (SEQ ID NO: 80) 
activity at > 0.5 ug/ml 

I LQNNMKDWRG (SEQ ID NO: 81) 
activity at > 0 . 5 ug/ml 

ILTNNMPDWRG (SEQ ID NO: 82) 
activity at > 0,1 ug/ml 



REPLACEMENT ANALYSIS-II 
30 comb, rep-an. I: 

improved position-07 : 

improved position-11 : 



ILTNNMPDWRG (SEQ ID NO: 82) 
activity at > 0.1 ug/ml 

I LTNNMGDWRG (SEQ ID NO: 83) 
activity at > 0.05 ug/ml 

ILTNNMPDWYG (SEQ ID NO: 84) 
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combination irnprov . 



activity at > 0.05 ug/ml 
I LTNNMGDW YG (SEQ ID NO: 85) 
activity at > 0.01 jag /ml 



Example- 6 : 

title: Identification of a mimotope through a set of 
different leads selected from phage-display library composed 
10 of >1, 000, 000 random hexapeptides . 

tools: set of 3 lead peptides (ANWPSA (SEQ ID NO: 86), KLITRW 
(SEQ ID NO: 87), NVCSWS (SEQ ID NO: 88)) 



15 methodology: Two rounds of replacement analyses (for each 

lead peptide) , followed by determination of overall consensus 
sequence . 
Results : 

In example-6, different lead peptides were used to 
20 identify a consensus sequence. Each lead peptide was used in 
multiple replacement analyses (two rounds) . The resulting 
three mimotopes were aligned which resulted in a lengthened 
mimotope with improved activity. 
REPLACEMENT ANALYSIS- I A 
SE£\ID NOS 86, 89-93, respectively, in order of appearance) 




30 




^riginkl lead : 


ANWPSA 


: activity 


at 


>10. 


0 


ug/ml 


improvea\position-01 : 


HNWPSA 


: activity 


at 


> 


5. 


0 


ug/ml 


improved p^sition-02 : 


AWWPSA 


: activity 


at 


> 


5. 


0 


ug/ml 


improved pos^iion-03 : 


ANAPSA 


: activity 


at 


> 


5. 


0 


ug/ml 


improved positften-04 : 


ANWSSA 


: activity 


at 


> 


5. 


0 


ug/ml 


combination impro\. : 


HWASSA 


: activity 


at 


> 


1 . 


0 


ug/ml 


REPLACEMENT ANALYSIS- 


IIA 















2 



(JSEQ ]>D NOS 93, 94, 94, respectively, in order of appearance) 
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omb. \ rep-an. IA 
improved posit ion- 05 
combination riQprov. 



HWASSA : activity at > 1 . 0 tag /ml 
HWASPA : activity at > 0.5 pg/ml 
HWASPA : activity at > 0.5 pg/ml 
PLACEMENT ANALYSIS-IB 
(S\Q ID NOS 87, 95-99, respectively, in order of appearance) 
original lead 

mprovied posit ion- 01 
improve^ position- 02 
improved ^sition-03 
improved po^it ion-0 6 
combination llnprov 



KLITRW 


: activity 


at 


>10 


.0 


pg/ml 


SLITRW 


: activity 


at 


> 5 


.0 


pg/ml 


KSITRW 


: activity 


at 


> 5 


.0 


pg/ml 


KLATRW 


: activity 


at 


> 5 


. 0 


pg/ml 


KLITRY 


: activity 


at 


> 5 


. 0 


pg/ml 


SSATRY 


: activity 


at 


> 1 


. 0 


pg/ml 



REPLACEMENT ANALYSIS- I IB 

(Se\ ID NOS 99, 100, 100, respectively, in order of 
ppe^rance ) 



comb. \ rep-an. IB 
improvec\ position- 02 
combination improv 



SSATRY 
SPATRY 
SPATRY 



activity at > 1 . 0 
activity at > 
activity at > 



pg/ml 
0 . 5 pg/ml 
0.5 pg/ml 




REPLACEMENT ANALYSIS-IC 

(SEQ\ID NOS 88, 101-104, respectively, in order of 
appearance ) 
cjrigina\ lead 

mproved \osition-02 
improved position- 0 4 
improved po^^tion-06 
combination improv . 



NVCSWS 


activity 


at 


>] 


L0 


0 


pg/ml 


N^CSWS 


activity 


at 


> 


5 


0 


pg/ml 


NVCHWS 


activity 


at 


> 


5 


0 


pg/ml 


NVCSWA 


activity 


at 


> 


5 


0 


pg/ml 


N^CHWA 


activity 


at 


> 


1 


0 


pg/ml 



REPLACEMENT ANALYSIS-IIC 

(SEQ 1$ NOS 104-107, respectively, in order of appearance) 



comb. \rep-an. IC 
nproved\ posit ion- 01 
mproved jjosition-02 
combination improv 



NICHWA 
YICHWA 
NVCHWA 
YVCHWA 



activity at > 1 . 0 pg/ml 
activity at > 0.5 pg/ml 
activity at > 0.5 pg/ml 
activity at > 0 .1 pg/ml 



i 
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ALIGNMEt 



combination improv. IIA, IIB and IIC; 



(SEQ ID NOS\94, 100, 107-108 respectively, in order of 



appearance 
combination improvements 1 
combination improvements 2 
combination improvement sX3 



consensus 



HWASPA 

SPATRY 

YVCHWA 

sYVCHWASSATRY 

activity at 0.01 ug/ml 



